A reanalysis of quasar absorption spectra results suggesting a spatial gradient in 

values of the fine-structure constant 
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We statistically analyse a recent sample of data points measuring the fine-structure constant a 
(relative to the terrestrial value) in quasar absorption systems. Using different statistical techniques, 
we find general agreement with previous authors that a dipole model is a well-justified fit to the 
data. We determine the significance of the dipole fit relative to that of a simple monopole fit, discuss 
the consistency of the interpretation, and test alternate models for potential variation of a against 
the data. Using a simple analysis we find that the monopole term (the constant offset in Aa/a) 
may be caused by non-terrestrial magnesium isotope abundances in the absorbers. Finally we test 
the domain-wall model against the data. 



I. INTRODUCTION 

Searching for possible variations in the fundamental 
constants of nature ahows us to test whether the laws of 
physics vary over space and time. There are many com- 
plementary probes that, taken together, span much of the 
Universe's history (see, e.g., the review IUzanll2003[ ) . Op- 
tical quasar absorption spectra provide a probe of vari- 
ations in the fine-structure constant, a — e^ /he, along 
a past-light cone centred on present-day telescopes. Nu- 
merous works employed comparison of alkali-doublets in 
space and in the laboratory to place limits on a- variation; 
these are easily interpreted since the energy separations 
are simply proportional to a^ . 

iDzuba et al\ (|l999al lbr) developed a different approach: 
the many-multiplet method, which exploits larger sensi- 
tivities to a-variation available in heavier ions and in 
transitions other than alkali- doublets. Moreover, the 
method allows for better control of systematics because 
there exist some negative shifters (with energy intervals 
that reduce with increasing a) as well as positive shifters 
and "anchor lines" (which have very weak sensitivity to 
a- variation) . In some cases both positive and negative 
shifters may be observed in the same ion. Additionally, 
the many-multiplet method sees a statistical gain because 
many more observed transitions may be used in the anal- 
ysis. 

During the last several years many absorption 
systems, observed using t he Keck t elescope in 
Hawa ii (iMurphv et al\ 12004 I2003L 120011: IWebb et al. 



199b) and tne Very Larg e ielescope m Chile (|King et al. 
2OI2I : IWebb et oLI [201ll) . have been analysed. Taken 



together, these provide measurements of Aa/a in ~ 300 
absorption systems covering most of the sky. Here 
Aa/a — {oL{r) — ao)/ao is the relative variation in a 
at a particular position r in the Universe where the 
absorption occurs. In IWebb et al\ ( 201ll ). the com- 
bined data sample is interpreted as providing evidence 
for variation in a throughout the Universe with an 
angular-dependence. This "dipole" model is found to be 
preferred to a monopole (constant offset) model of the 
variation at the A.la level. 



The values of Aa/a for each absor ber (presented in 
Murphy eta/.ll2004 lW ebb et a/j|201ll) are modelled in 
Webb et al\ (|201ll) (see also lKing eraril2012f) using sev- 
eral variants of a formula which can be written as 



Aa 



A + B{z)cose 



(1) 



where 6 is the angle between the direction of the mea- 
surement and the axis of the dipole, A is a constant (a 
"monopole" term) and B is the magnitude of the dipole 
term. The model variation comes from the form of the 
factor B, which in lWebb et al\ (|201l( ) was variously 

• B{z) = Bq, i.e. Eq. ([T]) represents pure angular- 
dependence without any distance-dependence; 

• B{z) = Bq r{z), where r = ct is the lookback time 
in giga-lightyears (Glyr); 

• B{z) = Bq z^ , here z is the redshift and /3 is an- 
other fitting parameter. 

The model of spatial variation that does not include a 
distance dependence is difficult to understand theoreti- 
cally, while fitting z" simply adds a poorly-constrained 
parameter to the data. Therefore in this paper we test 
the dipole interpretation B{z) ~ r. In this form ([T]) rep- 
resents a gradient in the value a throughout the Universe, 
and r cos 6 is the distance to a quasar absorption system 
along that gradient. 

Of course, r itself is model dependent at large red- 
shifts. In this work we use the stan dard AonM cosm ol- 
ogy parametrized by WMAP5 ( Hinshaw et aLll2009f) to 
determine r from the redshift z: 



<^) = TT 



1 



da 



Ho 7l/(l+z) Vf^mft ^ + f^A 



(2) 



Again, this reflects the fact that we are taking measure- 
ments along a past light cone centred on present-day 
Earth. An alternative approach is to use the comoving 
distance rather than lookback time: 



d{z) ^^ f 
^0 Ji/(i 



1 



da 



/(1 + 2) 



vn; 



r^A a 



(3) 



In this case the dipole model ([T]) is modified to use 
B(r) = BqcL^z). This parameterisation m.akes physical 
sense if one imagines that any spatial a-variation is 
"fixed" to the CMB frame and follows the same scale 
factor, a{t). 

This paper is organised as follows. In Section|llI]we de- 
velop criteria to assess the relative likelihood of different 
models of a- variation which we use throughout this work. 
After testing the normalcy of errors using a quantile- 
quantile plot, we use the modified Akaike Information 
Criterion to confirm that the preferred model of cosmo- 
logical a-variation is a dipole model with a monopole 
term. In Section ITVl we perform some additional statisti- 
cal tests to determine the significance of the dipole: an F- 
test and a modified error-ellipsoid method, described in 
detail in Appendix [X] To test the robustness of the data 
we perform biased and unbiased clippings of the data. 
In Section |V] it is shown that at least 40% of the quasar 
absorption systems would need to be removed from the 
sample to reduce the significance of the dipole to la. 

The monopole term of our preferred model of co- 
variation is somewhat difficult to reconcile with the re- 
quirement that 5a/ a should be zero locally, therefore in 
Section IVTl we test a hypothesis that changes in the rela- 
tive isotope abundances of magnesium could lead to the 
observed monopole term. We find that a relatively small 
increase in the abundance of ^^Mg relative to ^^Mg in 
the absorbers could account for the monopole term. Fi- 
nally, in Section FVlII we test whether the data supports a 
different model: that a "domain wall" separates regions 
of the Universe with different values of a (as proposed 
bv lOlive et aLl[201lh . Finding a best fit for this model 
requires a genetic algorithm, presented in Appendix |B1 



II. ASTRONOMICAL DATA 



divided into 'low-contrast' and 'high-contrast', defined 
as systems where both strong and weak lines are fitted. 
The 27 systems that constitute the high-contrast sample 
show large scatter in Aa/a values, which is attributed to 
weak components not being fitted when they're near the 
high optical depth edges of the strong transitions' profile. 
To this high-contrast sample is added an additional error 
frand — 1-743 X 10~^ which serves to reduce x^ per degree 
of freedom 'x^ /v to unity. The low-z and low-contrast 
high-z Keck samples do not require a dditional grand- 

The VLT/UVES data presented in iKing et al\ (|2012D 
also shows too much scatter in Aa/a. Quantifying this 
scatter depends on the underlying model assumptions, 
for example if one assumes that the dipole model is cor- 
rect (i.e. that there is a physical dipole in a that is being 
observed) and the data supports this, then drand should 
be smaller than it would be for a monopole model. The 
data we use includes errand = 0.905 x 10~^. This is a 
conservative (large) value which will tend to reduce the 
significance of the dipole relative to a monopole model. 
In practice all models give values of Gm nd in the same 
vicinity (see Table 2. of lKing et aLll2013 ) and the results 
of this paper are relatively insensitive to the exact firand 
used. 



III. INFORMATION CRITERIA FOR 
DIFFERENT MODELS 

In this work we use x^ ^■^ ^ likelihood measure for the 
various models. For measured values Xi = Aa/a and a 
model which takes values Xi = Xi{r) at the ith quasar, 
this is defined as the sum of squared residuals S,i '■ 



;ef=E 



yXi Xi) 



(4) 



Th e data use d in this paper are taken 

Murphy et all (|2004l ) (Keck data) and iKing et all 
(l20ld) (VLT), kindly supplied by the J. A. King in 



from 



a usable text format including the location of each 
absorber (redshift and direction) and its measured value 
of Aa/a. In total there are 293 points in our data file 
including 140 from Keck and 153 from VLT of which 
seven absorbers are seen in both the Kec k and VLT 
sampl es. The data is described fully in iKing etall 
( 20121 ). here we just note some points of particular 
interest to the analysis presented in this paper. Both 
samples account for unknown sources of scatter in the 
data by including extra systematic errors, CTrand, that 
are added in quadrature with the underlying statistical 
error. We will take each sample in turn. 

The Keck data is t aken from the 'fiducial sample' 
of iMurphv et all (|2004l ). This includes several samples 
of Keck/HIRES s pectra observed independ ently by dif- 
ferent groups (see lMurphv et aLll2004l2003l) for details). 
The combined data is divided into high-z (z > 1.8) and 
low-z subsamples, and the high-z subsample is further 



where ai is the observational uncertainty, x^ corresponds 
to the negative of the log-likelihood function for the 
normally-distributed random variable £,; smaller values 
of x^ correspond to better model fits. 

The normalcy of the residuals is an implicit assumption 
in using a x^ statistic. The validity of such an assump- 
tion may be ascertained by way of a quantile-quantile, 
or Q-Q, plot, which compares the residuals of a data set 
against the quantiles of a given distribution, or another 
sample. In addition to determining whether a data sam- 
ple is consistent with a given distribution, a Q-Q plot is a 
useful graphical method of comparing the qualitative fea- 
tures, such as skew, of samples of data against statistical 
distributions. 

We compare the residuals from the best fit dipole di- 
rection (light-travel distance) against a normal distribu- 
tion in Figure [TJ If our residuals are normal, then we 
expect the plot to lie approximately across the diagonal. 
Our comparison, Figure [TJ suggests that the residuals 
are near-consistent with a normal distribution, though it 
should be observed that there is some 'arching' in the 



curve, with the curve being below the diagonal near the 
tails and above near the centre. This indicates a negative 
skewness, that is, a long left-hand tail, in the residual dis- 
tribution. However, in light of the relatively small size 
of our sample, the slight skew we observe does not nec- 
essarily indicate that the residuals are not normal. We 
conclude that x^ is a reasonable statistic for our data. 

The addition of free model parameters on any data set 
can only reduce x^, therefore one must seek further jus- 
tification for their inclusion. The modified Akaike Infor- 
mation Criterion (AICc; the modification is that it has 
been corrected for small sample sizes) penalises model 
parameters, allowing us to compare the goodnes s of a y^ 
fit fo r differen t models. Th e AICc is defined as (jSugiural 
119781: see also lLiddlel[2007h 



AICc 



2 ^ 2p(p+I) 
n — p — 1 



(5) 



where p is the number of free parameters in our model 
and n is the number of data points in the sample. The 
preferred model is the one with the lower AICc. A sug- 
gested in terpretat i on sc ale for the AICc is that intro- 
duced by iJeffrevd (|l96lh , where a relative difference in 
the AICc of two models of < 5 indicates that the two 
models are almost equivalent in terms of significance. 

In TableUwe present the results of our AICc goodness- 
of-fit test of various a- variation models to the data. On 
the basis of these results we find: the Keck sample ex- 
hibits a preference for the monopole fit; the VLT data 
is better fit by a dipole model; and the combined sam- 
ple gives lowest x^ for the dipole -I- monopole model and 
the two-value model. In the next section we present re- 
sults from different statistical tests that suggest that the 
dipole model is preferred even for the Keck data subset. 
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FIG. 1. Quantile-quantile plot for all A*' = 293 data points. 
Coordinate yi: ordered residuals, ^i, of the data. Coordinate 
Xi: empirical estimate of the location of the i*'' A^-tile of the 
normal distribution, i.e. CDF(xi) ~ i/N where CDF(a;) is the 
cumulative distribution function of the normal distribution. 



TABLE I. Goodness of fit test using x^ /u {u — n — p is the 
number of degrees of freedom) and the corrected Akaike Infor- 
mation Criterion (AICc) for various models of the form ([l}. 
Models with a dipole term are calculated using both light- 
travel distance, r, and comoving distance, d. The "two-value" 
model allows separate monopole values for Keck and VLT. 



Model 


P 


2 

X 


xV'^ 


AICc 


Combined sample: 










Null (Aq/q = 0) 





310.439 


1.0595 


310.439 


Monopole 


1 


303.77 


1.0403 


305.784 


Dipole r 


3 


284.628 


0.9815 


290.711 


Dipole d 


3 


285.185 


0.9834 


291.268 


Dipole r + monopole 


4 


279.66 


0.9677 


287.799 


Dipole d + monopole 


4 


279.845 


0.9683 


287.984 


Two- value 


2 


282.585 


0.9711 


286.626 


Keck (14-0 data points): 










Null 





157.898 


1.1278 


157.898 


Monopole 


1 


132.799 


0.9554 


134.828 


Dipole r 


3 


142.439 


1.0397 


148.615 


Dipole d 


3 


143.611 


1.0482 


149.787 


Dipole r + monopole 


4 


131.28 


0.9653 


139.576 


Dipole d -f- monopole 


4 


131.309 


0.9655 


139.605 


VLT (153 data points): 










Null 





152.541 


0.9970 


152.541 


Monopole 


1 


149.786 


0.9854 


151.812 


Dipole r 


3 


139.908 


0.9327 


146.069 


Dipole d 


3 


138.839 


0.9256 


145.000 


Dipole r + monopole 


4 


139.288 


0.9348 


147.558 


Dipole d 4- monopole 


4 


138.167 


0.9273 


146.437 



For the full (combined) sample, the best AICc re- 
sult (though not the best x^) comes from the two- 
value model when ^Kcck = (-0.572 ± 0.114) x 10~^ and 
AvLT = (0.208 ± 0.125) X 10"^ According to the Jef- 
freys criterion, it is therefore of similar significance to 
the dipole-|-monopole model. The two- value model is ex- 
pected if there exist unknown systematics that cause ap- 
parent shifts in a, and these 'intra-telescope' systematics 
are different for the two telescopes. However we will ar- 
gue that the two-value result is not in conflict with the 
dipole result; it is actually expected given the distribu- 
tion of quasars seen with Keck and VLT. 

Let us first discuss the idea that there are unknown 
intra-telescope systematics that cause spurious shifts in 
the observed value of Aa/a. What kind of systematic is 
required? Any linear transform (offset or scaling) of the 
frequency scale will not mimic a-variation. The many- 
multiplet method uses many lines for the analysis: some 
are positive shifters, some are negative shifters, and some 
are anchors. In order to mimic a-variation it is therefore 
necessary for a systematic to shift lines in different direc- 
tions. One suggestion is that intra-order systematic shifts 
in calibration of the Keck spec trograph could mim ic a- 
variation in any one absorber (JGriest et a/.! 120101 ): it is 
less clear that this would have a non-zero mean when 
averaged over many absorbers. S imilar intra-order shift s 
have also been found in the VLT ( Whitmore et a/.ll2010t ). 
but it's worth noting that these are smaller in amplitude 



FIG. 2. A histogram of the distances, r = ct, to each absorber 
projected along the dipolar axis. The bars with positive- 
sloped lines on the left are from the Keck sample, while the 
bars with negative-sloped lines on the right represent the VLT 
data. It is clear that the two telescopes sample different por- 
tions of the sky. 




than the Keck shifts. 

The proposition that intra-telescope systematics can 
mimic a-variation can also be tested using the subset of 
quasar absorption systems that have data from both VLT 
and Keck telescopes. There are seven such systems, and a 
sophisticated analysis of them shows n o systematic veloc - 
ity offsets between the two telescopes (jWebb et aLll201l[) . 
Furthermore, the dipoles fit separately from the two tele- 
scopes point in the same direction (within la errors). 
This is entirely unexpected if systematics accounted for 
monopole offsets for the telescopes: the probability that 
the dipoles would point in the same direction by chance 
was estimated at 6%. A similar coincidence in the dipole 
directions was found when the sample was split in to high 
{z > 1.6) and low redshifts (|Webb ei aLll201l[ ): these 



subsamples use very different transitions i n different ions 
with different responses to Aa/a (see, e.g. lMurphv et al\ 
l2003l) . 

An alternative explanation to intra-telescope system- 
atics is that the two- value result is simply an artefact due 
to the two telescopes sampling data from significantly 
different portions of the sky. If each telescope had full 
coverage of the sky, then one would expect that averaging 
over all absorbers would give back the monopole term of 
Aa/a. However, the telescopes do not each have full sky 
coverage. Figure [5] shows the number of quasar absorp- 
tion systems seen by each telescope against the distance 
to each absorber projected along the dipolar axis. The 
two telescopes gather data from different parts of the sky 
(this is simply due to their locations on the Earth) . If the 
fitted dipole-l-monopole model represents the true distri- 
bution of a in the Universe, then it is to be expected that 
a weighted mean over the observed values yields different 
values for the two telescopes. 



TABLE II. Significance of the dipole-|-monopole model using 
lookback time r as the distance measure for the Keck sample, 
the VLT sample, and the combined data. Significance is ob- 
tained using both the F-test and the error-ellipsoid method 

(EEM). 



Keck VLT Combined 

i^-test (a) 0.43 2.50 4.21 
EEM (a) 0.55 2.37 4.09 



IV. DIPOLE SIGNIFICANCE 

Additional statistical tests are available for distin- 
guishing between alternate "nested" models. Two mod- 
els with pi and p2 parameters {pi < P2) are nested 
if the entire parameter space of model 1 is contained 
within that of model 2; this is obviously the case for 
the 4-parameter dipole-|-monopole model and the single- 
parameter monopole model, x^ must improve when the 
three additional dipole parameters are included. The -F- 
test provides us a method to find the probability that this 
improvement is due to chance. We define the F statistic 
by 



F 



{x\-xl)/{P2-pi) 
Xl/{n-p2) 



P2 > Pi 



(6) 



where n is the number of data points. Under the null 
hypothesis this statistic has an F-distribution; we can 
infer the probability that the improvement in x^ is due 
to chance by 



P= 1 



F{x; p2~pi,n- P2)dx . 



(7) 



We express this as a standard deviation a in the usual 
way, 1 — P = erf(cr/-\/2) where erf is the Gauss error 
function. The significance of the dipole for the Keck, 
VLT, and combined data samples is presented in Tablellll 
We again find that the Keck sample shows no preference 
for a dipole model, the VLT sample shows a significant 
preference for a dipole, and the combined sample shows 
a strong preference for a dipole model. 

Another method of assessing the significance of the 
dipole model is to simply calculate the usual statisti- 
cal significance of the dipole parameters relative to zero. 
We use a maximum likelihood analysis to find the pre- 
ferred direction and magnitude of the dipole. An er- 
ror ellipsoid is generated representing the uncertainty in 
the three-dimensional parameter space of the dipole, and 
the proportion of the best-fit dipole that is contained 
within this error ellipsoid indicates the significance of 
the dipole mo del. Thi s method was used, for exam- 
ple, by Cooke and Lvnde n-Bell (2010) to assess evidence 
for a dipole in the accelerated expansion of the Uni- 
verse. In Appendix |X] we present an extension of this 
method, which we refer to as the error-ellipsoid method 
(EEM) , to allow for distance-dependent dipole models of 
the type (P) . Table HI] shows that the EEM gives values 



for the significance of the dipole+monopole model rel- 
ative to the monopole-only model which are consistent 
with the F-test method for all data sets. 



V. ROBUSTNESS OF THE DIPOLE MODEL 

We wish to test whether the dipole result is due to a 
small proportion of the data sample. To do so we per- 
form biased and unbiased clippings of the data. We start 
with a biased iterative clipping method, where at each 
increment we remove a point that lends support to the 
dipole model. One option is to remove the absorber pro- 
ducing the smallest residual value ^. However, the data 
points with small residuals may be absorbers lying near 
the plane orthogonal to the fitted dipole axis, and these 
would not contribute to the dipole effect. Therefore we 
introduce an angular weighting that further biases the 
clipping towards removal of points that lie on the dipole 
axis: 



cos 9i 



Ui cos 9i 



where the Xi is the dipole model value for that absorber 
and 9i is the angle from the dipole axis. 

After each clipping of the absorber with smallest 
weighted residual, we calculate the significance of the 
dipole using the AICc, F-test and EEM. We present the 
result of these tests in Figure |3l The solid black line gives 
the significance of preference for the dipole-f monopole 
model over the pure monopole model as calculated via 
the F-test. This test indicates that the dipole model is 
preferred over the monopole (at la significance or more) 
until approximately 120 of the strongest points, or 40% 
of the data, is removed. The significance as calculated 
using EEM is given by the dashed line of Figure |3l we 
see that the consistency between significance assessed us- 
ing F-tests and EEM established in Section IIVI extends 
to clipped subsets of the data until ^120 points have 
been removed and the significance is smaller than la. 
The vertical dashed line indicates the point at which the 
dipole-|-monopole model is no longer preferred over the 
monopole model as determined by the AICc. Again, this 
occurs after '^ 120 points have been removed. 

The results of our biased clipping suggest that approx- 
imately 40% of the data that best supports the dipole 
must be removed before the significance of the dipole is 
reduced to Icr. Therefore, we expect that if we randomly 
remove 120 data points from the sample and re-fit our 
data, the dipole-|-monopole model would still be preferred 
(with at least la significance) over the monopole model. 
In Figure |4] we present the probability density function 
for this unbiased clipping performed 10 000 times. Each 
time, the dipole significance is calculated using an F-test. 
We find that, as expected, the significance is at least la in 
almost all 60% subsets of the data. Taken together with 
our biased clipping, this shows that the dipole result is 
not due to a small number of outliers in the sample. 



FIG. 3. Significance of the preference for a dipoIe-|-monopole 
model over the monopole model versus number of data points 
clipped. At each step we have removed the absorber with 
smallest weighted residual ^/cos8, where 8 is the angle be- 
tween the location of the absorber and the dipole axis. Solid 
line: significance assessed using F-test; dashed line: signifi- 
cance assessed using EEM. The vertical dashed line indicates 
the point at which the dipole model is not preferred according 
to the AICc. 
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FIG. 4. Probability density function for the significance of the 
dipole after random removal of 40% of data, assessed using the 
F-test. We see that in almost all cases the dipole-|-monopole 
model is preferred to the pure monopole model at more than 
la significance. 
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It is useful to cor itrast our biased i terative clipping 
to that presented in IWebb et al\ (J201l[ ). In that work, 
the point with largest residual was removed at each step 
of the procedure. This is useful for showing that the 
dipole is not caused by a few outlying data points, but 
it can also remove absorbers that do not suppo rt the 
dipole model. For example, in lWebb et al\ ( 201l[ ) there 
is no weighting by cos 6, and deviant points that lie in 
the perpendicular plane may be removed. Therefore it is 
unsurprising that the s ignificance of t he dipole increases 
at first (Fig. 4 in W ebb et a/.ll20lil) . However, there 
is a trade-off: the significance eventually falls as more 
points are removed because there are simply fewer data. 



I Webb et all ( 201 it) found that approximately 60% of data 
with the largest residuals had to be discarded before the 
significance dropped below 3a. 



VI. MAGNESIUM AND THE MONOPOLE 



The procedure of IWebb et all (|201ll) is to simultane- 
ously fit the redshift z and Aa/a (along with column 
densities, Doppler widths, etc.) for the entire quasar 
spectrum. The monopole term in the full data set, 
Aa/a — —0.19 x 10~^, corresponds to a shift in the Fen 
lines of 



We have seen from our AICc analysis (Section |III[) that 
there is some justification for including the monopole 
term of ^ as well as the dipole term. One natural 
interpretation of such a term is time-variation of the 
fine-structure constant, of the kind that was in dicated 
in previous Keck studies ( Murphv et all 120031 ). One 
would then expect the monopole term to have an explicit 
time-dependence (or, equivalently, redshift-dependence 
A = A{z)), although the form may not be resolved by 
the data. 

Another possible explanation for a monopole could be 
chemical evolution of the Universe, which changes the 
isotope abundance ratios. Since terrestrial abundances 
are assumed, any deviation from these ratios in the ab- 
sorber will shift the centroid of the line profile, and this 
might mimic a change in a. Accounting for a system- 
atic such as this is difficult, especially since the isotope 
shift is unknown for many of the lines used in the analy- 
sis. The transition with the largest known shift used in 
the analysis are the AA2796 and 2803 lines in Mg ii, and 
calculations suggest that th e unknown isotope shifts are 
smaller and less important ( Kozlov et alluOM\ . 

We have applied a simple test to see whether the iso- 
tope abundance ratios of magnesium could cause the 
monopole. We have simply removed all data points taken 
from absorbers where the Mg ii lines are present and used 
in the analysis. 113 absorbers remain from our initial 
sample of 293: our statistical significance is hugely re- 
duced. The removal of points with Mg ii is by no means 
a random sampling: these Mg II lines are seen in low red- 
shift systems since at z > 2 they are redshifted outside 
the range of optical telescopes. 

The results of our fitting are shown in Table IIIII For 
models that include a dipole we use the light-travel time 
as a measure of distance B{z) = Bor{z). We see that the 
best-fit dipole parameters for the Mg ii-removed data are 
consistent with those of the complete data set, but we no 
longer have a statistically significant monopole. Using 
an _F-test we find that the dipole-|-monopolc model is 
preferred over the monopole model at 1.9a significance 
for the Mg ii-removed set. 

We may try to estimate the magnitude of the change 
in Mg II abundances that could mimic the observed 
monopole term in the original data set. The two Mgii 
lines (at ~ 35000 cm~^) are often observed simultane- 
ously with Fell lines (~ 40000 cm~^) which in these sys- 
tems will provide most of the sensitivity to a- variation 
(g ~ 1500 for positive shifters in Fell). To obtain our 
order-of-magnitude estimate we assume here that only 
Mgii and positive-shifting Fell lines are present in the 
system. 



Aw 

UJ 



2qAa 



1.4 X 10" 



Fell 



This shift manifests as an effective change in the redshift 
of the quasar absorption system as measured by the Fe ii 
lines, but not as measured by the Mgii lines. It is pos- 
sible to obtain consistency in measured z if the relative 
isotope abundances of Mgli are assumed to vary. The 
isotope shif t of the Mgii lines is Awj,'? = w^^ — w^'' — 
0.102 cm^^ ( Drullinger et a/.lll980l ). In order to compen- 
sate the monopole term of the observed a-variation, the 
relative abundance of x =^^ Mg/^^Mg would have to 
change by 



Alois 



UJ 



1.4 X 10"^ 



Mgn 

X « 0.05 



That is, a roughly 5% increase in the relative abundance 
of ^^Mg could remove the observed monopole in Aa/a. 
Alternatively, a ^ 10% increase in the ^^Mg abundance 
relative to ^^Mg, would also work - or any combination 
of the two (e.g. the absorber may have the terrestrial 
25]y[g .26 ]y[g j-g^^jo^ and a reduction in ^"^Mg relative to 
the heavier 25,26]y[g isotopes). 

Of course our assumptions are quite rough. How- 
ever a previous study of the effect of isotope abun- 
dance in_the_j^eck_data^aine to much the same conclu- 
sion (JMurphv et aLll20M ). In their method, the quasar 
absorption spectra were refitted using a different value 
of magnesium heavy-isotope abundance and the Aa/a 
extracted were averaged assuming a monopole model. 
This found a linear relationship between their assumed 
25,26]y[g^24-|y[g j-^tio and the extracted Aa/a. A change 
in Aa/a of 0.2 x 10~^ required an increase in the abun- 
dance of heavy M g isotopes relative to ^^Mg of around 
10% (see Fig. 6 of lMurphv et aLll2004[ ). A similar anal- 
ysis applied to the z < 1.6 subsample of the Keck-|-VLT 
data also found that the monopole could be removed by 
a change in the ^'^'^^Mg/^^'Mg abundance ratio from the 
terrestrial valu e of 0.21 to a heav y-isotope enhanced value 
of 0.32 ± 0.03 (|King et aLll2"oTl) . 

Generally other species are present in our quasar ab- 
sorption spectra, and this could remove the degeneracy 
between variation of magnesium isotope abundance and 
variation of a. A more complete analysis should allow 
Mg isotope abundances to vary in the fitting procedure, 
which may be possible if this is restricted to only one 
additional parameter in the entire sample of ~ 300 ab- 
sorption systems. 



TABLE III. Best fit parameters and values of x^ and '}^ jv for different model fits to the data when all absorbers with Mgll 
have been removed. In models that include a dipole, the direction of the dipole axis is specified using right ascension (RA) and 
declination (Decl.) in equatorial coordinates. 



Model 



Parameter values 



X 



xV^ 



Subset with Mgll data removed: 
Null Aa/a = 

Monopole Aa/a = A 

Dipole Aa/a — Bor cos 



Dipole + monopole 



Aa/a — A + Bo r cos t 



yl = 0.16(15) X 10"^ 

Bo = 0.105 (35) X 10"^ Glyr-i 

RA = 16.0 (1.8) hr, Decl. = -63 (12)° 

^ = 0.12(15) X 10"^ 

Bo = 0.103 (35) X 10" 

RA= 15.8(1.8 ) hr, " 



^ 'Glyr-^ 
Decl. = -61(12)'^ 



126.565 
125.421 
117.491 

116.863 



1.1200 
1.1198 
1.0681 

1.0721 



All data: 

Dipole + monopole 



Aa/a = A + Bo r cos t 



A = -0.19(8) X lO"'^ 

Bo = 0.106 (22) X 10"^ Glyr" 

RA= 17.4(1.0) hr, Decl. = - 



62(10)° 



279.66 



0.9677 



VII. DOMAIN WALL MODEL 

Recently a differen t model for spatia l variation of a 
has been proposed bv lOlive et all ( 201l[ ) where the Uni- 
verse is divided into two domains, each with a different 
value of a. On the Earth-side of the domain wall a takes 
the terrestrial value, while on the other side it takes a 
different value. We can parametrize this model by the 
equation 



Aa \ A, d cos 9 — dwaii > 
a I otherwise 



(8) 



where dwaii is the shortest distance to the wall (we use 
comoving distance, Eq. ([3]), in this section) and 9 is the 
angle between the direction of the shortest distance to 
the wall and a quasar. 

Again, values for dwaii and the direction of the wall are 
chosen to minimise x^ ■, however in this case x^ is discon- 
tinuous with respect to changes in the parameters: any 
given quasar must either be on this side of the wall (model 
value Aa/a = 0) or the other (model value Aa/a — A) . 
This necessitates the use of a genetic algorithm to de- 
termine the minimum values of x^; our algorithm is pre- 
sented in Appendix |B] 

The best fit model for the entire data sample was 
found to have A = -1.055 x 10-^ dwaii = 5.513 Gly, 
RA = 20.1 hr, Decl. = 68°. The model has 4-parameters 
with x^ — 281.76; this can be compared with the 4- 
parameter dipole-|-monopole model with comoving dis- 
tance, for which x^ = 279.85. However it is worth noting 
that this minimum x^ is obtained for an extremely nar- 
row range of parameters, and even a small deviation in 
any of them increases it dramatically. For example, in 
Figure [5] we present x^ in the direction of best-fit (solid 
line) as a function of dwaii • 

The best-fit wall is located at an angle 47° to the dipole 
axis (towards negative values of Aa/a, i.e. 180° from the 
directions presented in Table HlH) . This is not consistent: 



distance to the wall, dwaii- Solid line: in the direction of the 
best fit for the wall (the best fit occurs at dwaii = 5.513 Gly); 
dashed line: in the direction of the dipole axis (decreasing a 



direction) . By comparison 
model is 279.66. 
312 



X for the spatial gradient (dipole) 




if the Universe really does have a domain wall in values 
of a one would expect a dipole fit to the data to have 
the same direction. If we calculate x^ as a function of 
wall distance parameter dwaii along the direction that is 
given by the best-fit dipole we obtain the dashed line 
in Figure [5j It is seen that the behaviour is much as 
would be expected if the dipole model is correct: as dwaii 
increases, fewer absorbers are found on the other side of 
the wall, the model value Aa/a = for more absorbers, 
and x^ increases. 



VIII. CONCLUSION 

We have developed a number of statistical tests to use 
quasar absorption data to differentiate between possible 
models of cosmological a-variation. We confirm that a 
model of variation including both dipole and monopole 



terms is preferred, however the monopole may be an arte- 
fact due to non-terrestrial magnesium isotope ratios in 
the absorbers. A small change (approximately 5%) in the 
ratio of ^^Mg to ^''Mg could account for the monopole, 
leaving the data to be explained by a purely spatial gra- 
dient in values of a across the Universe. 

Our robustness tests on the data showed remarkable 
consistency between various statistical tests: at least 40% 
of the data must be deleted in a heavily biased way to 
reduce the significance of the a-dipole to Itr. An unbiased 
removal of 40% of the data leaves the dipole significance 
at around 3 ± Ict. This confirms that the original result, 
that there is evidence for a spatial gradient in values of a 
at the 4.1(7 level, is not due to only a few deviant points. 

Finally we developed a genetic algorithm to find best- 
fit parameter s for the domain w all model of a-variation 
suggested bv lOlive et al\ (|201l[ ). We find that the best 
fit direction of the wall is not consistent with the dipole 
direction. This is because ^ in the best-fit region is 
anomalously low with respect to its local neighbourhood 
of parameter space. On the other hand if we force the 
domain wall to be perpendicular to the best-fit dipole 
direction, the resulting ^ are significantly larger than 
those of the dipole model. 
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For a dipole model, Xi is some function on the dipole 
direction and magnitude, which we represent as L. Max- 
imising the likelihood with respect to the model param- 
eter gives the expression 

A L = V. 

The covariance matrix for L is given by A~"^, which 
will depend on the various models as follows: 



Xi 



Xi 



m + B cos 9 


Xi = m(l +li -L) 




|L| ^ B/m, 






- Br cos 6 


Xt ^rli -L 




\L\^B, 




^2 



(ii<E)U). 



O^ -K Gi 



d,). 



Xi = m + Br cos 9 Xi = mr{l/r -\-li ■IS) 
|L| =i?/m. 
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Appendix A: Assessing the dipole significance using 
error-ellipsoid method 

In order to present a comparison of the significance of 
the models described above, we employ a similar strategy 
to that taken bv ICooke and Lvnden-Belll (|2010f ). gener- 
alised for our distance-dependent dipole models. The 
probability that we observe data Xi = {Aa/a)i given our 
model value at the ith quasar, Xi, is 



Pr(a;i|a;j) 



VM^ 



exp 



-{Xi 



2(ct2 



(Al) 



where ai is the observational error and a represents an 
intrinsic scatter of the true values from our model values 
Xi- The best fit model parameters can then be found so 
that they maximise the log-likelihood function 



C^ln 



Y[Pt{x,\x,) 

i 



_\ _ {X; - Xjf 

1 2(a2-Ha2) 



(A2) 



where m is the monopole magnitude, B is the dipole 
magnitude, r is the distance to the absorber and 9 is the 
angle between direction of the dipole and the absorber. 
We have taken a direct computational approach to de- 
termining the best fit model parameters. The probability 
that a dipole is preferred over the monopole, and hence 
the significance of the dipole result, is then determined as 
with the method of Cooke and Lynden-Bell. Given the 
covariance matrix A~^, we have that the semiprincipal 
axes of the error ellipsoid for L are Sj = ^/Xici, where 
Ai and e,; are the eigenvalues and eigenvectors of the co- 
variance matrix, respectively. The probability that the 
dipole should be included in the model, Pr(L), is then 
found by integrating over the volume of the parameter 
space obtained by expanding or contracting the error el- 
lipsoid such that it just encloses L. Then 



Pr(L) = erf(l/^V2) 



^exp(~l/2Ai^), 



where ^ = L^rr/l^l is the fraction of L enclosed by the 
error ellipsoid 

Lerr = [(L • Si)' + (L • S2)' + (L • 83)^^' 

and erf is the error function. 



Appendix B: Genetic algorithm for the wall model 

The standard procedure of minimising x^ to find best- 
fit model parameters is numerically difficult for the wall 
model ^ because x^ is discontinuous with respect to the 
model parameters: x^ changes in discrete steps depend- 
ing on whether any given quasar absorption system is on 
the Earth-side or far-side of the wall. This necessitates 
a genetic algorithm with simulated annealing to perform 
the fit. 

In a genetic algorithm, sets of model parameters are 
produced in 'generations' with those producing the best 
fit (in our cases the smallest x'^) in a particular generation 
retained to 'breed', producing the next brood of param- 
eters. In each generation, 'mutations' are introduced in 
order to produce the quasi-random variations of parame- 
ters necessary to cover the parameter space. The magni- 
tude of such 'mutations' are incrementally reduced in a 
process of simulated annealing so that the mutations be- 
come finer as parameters converge to their best fit values. 
As is usual when using genetic algorithms to determine 
best fits, several annealings are required. Unusually, in 
this case due to the discontinuous nature of the parame- 
ter space, we not only require several annealing steps but 
also several cycles of sets of annealing with new 'isolated' 
populations of parameters in order to avoid a situation 
where the entire population inhabits local minima in the 
parameter space. 

In each cycle, we perform 5 annealing steps, as it 
was found that subsequent steps produced limited ex- 
ploration of the parameter space due to the rapid con- 
vergence of the parameters, and that the re-initialising 
of genetic material in each cycle produced more efficient 



coverage of the parameter space. For this reason, only a 
single survivor is retained per cycle, to be bred with nine 
new randomly created populations. In each annealing 
step, 500 generations of model parameters are produced, 
with the variance scaled down with each generation. 

We retain 10 sets of model parameters {i} per gen- 
eration, with which to produce 500 offspring. We can 
think of these as "vectors" in the 4-dimensional param- 
eter space of the wall model. The offspring have genes 
taken to be the average of the parent values {i} and {j} 
with an additional random term: 



{child} 



W + W 



{C}-m-{j}) 



where the four terms in {^} are drawn randomly from 
a Gaussian distribution with standard deviation (Toxtra- 
The random scatter is reduced as the parameter values 
converge during the simulated annealing: 



'^cxtra — f^n 



500-5 

'' 500 



where g is the number of generations that have been run 
on a given annealing. The new generation of parame- 
ter values (parents and offspring) arc then sorted by the 
fitness parameter —x^ such that the 10 parameter sets 
producing the largest fitness parameters are retained. 

After sufficient cycles (5 was found to be adequate) the 
result is tested for convergence. The result is deemed to 
be converged if the algorithm has not produced a better 
result for four consecutive cycles. Otherwise, the cycles 
continue. This sacrifices speed of convergence for robust- 
ness of the result. 
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